Genome bias influences amino acid choices: analysis of amino acid substitution and re-compilation of substitution matrices exclusive to an AT-biased genome

نویسندگان

  • Umadevi Paila
  • Rohini Kondam
  • Akash Ranjan
چکیده

The genomic era has seen a remarkable increase in the number of genomes being sequenced and annotated. Nonetheless, annotation remains a serious challenge for compositionally biased genomes. For the preliminary annotation, popular nucleotide and protein comparison methods such as BLAST are widely employed. These methods make use of matrices to score alignments such as the amino acid substitution matrices. Since a nucleotide bias leads to an overall bias in the amino acid composition of proteins, it is possible that a genome with nucleotide bias may have introduced atypical amino acid substitutions in its proteome. Consequently, standard matrices fail to perform well in sequence analysis of these genomes. To address this issue, we examined the amino acid substitution in the AT-rich genome of Plasmodium falciparum, chosen as a reference and reconstituted a substitution matrix in the genome's context. The matrix was used to generate protein sequence alignments for the parasite proteins that improved across the functional regions. We attribute this to the consistency that may have been achieved amid the target and background frequencies calculated exclusively in our study. This study has important implications on annotation of proteins that are of experimental interest but give poor sequence alignments with standard conventional matrices.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The compositional adjustment of amino acid substitution matrices.

Amino acid substitution matrices are central to protein-comparison methods. In most commonly used matrices, the substitution scores take a log-odds form, involving the ratio of "target" to "background" frequencies derived from large, carefully curated sets of protein alignments. However, such matrices often are used to compare protein sequences with amino acid compositions that differ markedly ...

متن کامل

Evaluating the efficacy of a structure-derived amino acid substitution matrix in detecting protein homologs by BLAST and PSI-BLAST

The large numbers of protein sequences generated by whole genome sequencing projects require rapid and accurate methods of annotation. The detection of homology through computational sequence analysis is a powerful tool in determining the complex evolutionary and functional relationships that exist between proteins. Homology search algorithms employ amino acid substitution matrices to detect si...

متن کامل

Substitution of soybean with canola meal in laying hens diets formulated based on total and digestible amino acids on performance and blood parameters

An experiment was conducted to study the effects of substitution soybean meal (SBM) with canola meal (CM) and formulated diets based on total and digestible amino acid on performance, egg quality, organs weight and blood parameters of laying hens from 73 to 83 weeks of age. A total of 128 laying hens were distributed by completely randomized design in a 2×2 factorial arrangement with 2 protein ...

متن کامل

Isolation and Characterization of a New Peroxisome Deficient CHO Mutant Cell Belonging to Complementation Group 12

We searched for novel Chinese hamster ovary (CHO) cell mutants defective in peroxisome biogenesis by an improved method using peroxisome targeting sequence (PTS) of Pex3p (amino acid residues 1–40)-fused enhanced green fluorescent protein (EGFP). From mutagenized TKaEG3(1–40) cells, the wild-type CHO-K1 stably expressing rat Pex2p and of rat Pex3p(1–40)-EGFP, numerous cell colonies resistant to...

متن کامل

Insights from the analysis of conserved motifs and permitted amino acid exchanges in the human, the fly and the worm GPCR clusters

G-protein coupled receptors (GPCRs) belong to biologically important and functionally diverse and largest super family of membrane proteins. GPCRs retain a characteristic membrane topology of seven alpha helices with three intracellular, three extracellular loops and flanking N' and C' terminal residues. Subtle differences do exist in the helix boundaries (TM-domain), loop lengths, sequence fea...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 36  شماره 

صفحات  -

تاریخ انتشار 2008